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(54i Novel process for constructing a cDNA library and a novel polypeptide and DNA coding the same. 



(57i A process for constructing a cDNA library 
which has a selectivity for signal peptides, that 
makes it possible to efficiently identify unknown 
and useful polypeptide comprising a signal pep- 
tide. A novel polypeptide consisting of 89 amino 
acids (including a signal peptide) produced by a 
stroma cell line, which is useful as an agent for 
preventing or treating, for example, anemia, 
leukopenia or infections and the like, and DNAs 
coding for said polypeptide, have been iden- 
tified using the process of the invention. 
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This invention relates to a process for constructing a cDNA library, and a novel polypeptide and a DNA 
coding for the same. More particularly, it relates to an efficient process for constructing a cDNA library having 
a high selectivity for stgnal peptides, and a novel polypeptide produced by a specific stroma cell line and a 
DNA coding for said polypeptide. 

In order to obtain specific polypeptides (for example, proliferation and/or differentiation factors) or a DNA 
coding for the same, there have been generally employed methods comprising confirming the target biological 
activity in a tissue or a cell culture medium and then cloning of a gene through the isolation and purification 
of a polypeptide and further methods comprising expression-cloning of a gene with the guidance of the bio- 
logical activity. 

However, it is frequently observed that a gene, which has been cloned with the guidance of a certain ac- 
tivity, codes for a known polypeptide since many physiologically active polypeptides occurring in vivo have 
various biological activities. Further, most intravital physiologically active polypeptides are secreted only in a 
trace amount which makes the isolation and purification thereof and the confirmation of biological activity 
difficult. 

Recent rapid developments in techniques for constructing cDNAs and sequencing techniques have made 
it possible to quickly sequence a large amount of cDNAs. By utilising these techniques, a process, which com- 
prises constructing cDNA libraries from various cells and tissues, cloning cDNAs at random, identifying the 
nucleotide sequences thereof, expressing the corresponding polypeptide and then analyzing its physiological 
functions, is now in use. Although this process is advantageous in that a gene can be cloned and information 
on its nucleotide sequence can be obtained without any biochemical or genetic analysis, the target gene can 
often only be identified by chance. 

The present inventors have studied the cloning of genes for proliferation and differentiation factors in he- 
matopoietic and immune systems. They have paid attention to the fact that most secretory proteins such as 
proliferation and/or differentiation factors (for example various cytokines) and membrane proteins such as re- 
ceptors thereof (hereinafter these proteins will be referred to generally as secretory proteins and the like) have 
sequences called signal peptides in the N-termini. Extensive studies have been conducted to provide a process 
for efficiently and selectively cloning genes coding for signal peptides. As a result, a process has now been 
devised whereby an N-terminal fragment can be efficiently amplified and the existence of a signal peptide 
can be easily examined. 

In accordance with the present invention, cDNAs with a high probability of containing a signal peptide are 
ligated at both their ends to linkers containing restriction enzyme sites which are different from each other. 
These fragments alone are rapidly produced in a large amount by the polymerase chain reaction (PCR) method 
so as to elevate the content of the fragments with a high probability of containing a signal peptide. Next, the 
above-mentioned fragment is introduced into an expression vector containing DNA coding for a known secre- 
tory protein or the like but lacking DNA encoding the corresponding signal peptide. Many secretory proteins 
and the like are secreted or expressed on cell membrane even if the signal peptide has been substituted by 
a signal peptide of another protein. Therefore, if the known secretory protein or the like is expressed on cell 
membrane or outside the cells, this confirms that a cDNAfragment corresponding to a stgnal peptide has been 
introduced into the expression vector. Thus the inventors have devised a convenient method for detection of 
the insertion of cDNA encoding a signal peptide. 

The polymerase chain reaction used in the present invention is known as a method for amplifying specific 
DNA fragments in a large amount. It is also known that many secretory proteins and the like can be expressed 
even if the signal peptide thereof is substituted by that of another secretory protein or the like. However, to 
the applicants' knowledge, there has been no suggestion that these techniques be used as the inventors have 
done to provide a process for selectively cloning a signal peptide. 

It is known that hematopoietic cells secrete various proliferation and/or differentiation factors exemplified 
by interleukin. 

The present invention further relates to a novel polypeptide obtained from hematopoietic cells and DNA 
coding for the same. 

The inventors have sought a novel factor (polypeptide) produced by hematopoietic cells using the process 
which is the first subject of the present invention. As a result, a novel polypeptide and DNA coding for the same 
have been identified. 

Computer searches to identify polypeptides having sequences identical or highly homologous with that 
of the polypeptide of the present invention and the DNAs coding for the same have not identified any such 
sequences. Thus, to the best of the applicants' knowledge, the polypeptide of the present invention and the 
DNA coding for the same are novel. Further, homology analysis has revealed that the polypeptide of the present 
invention is a member of the chemokine family as it has a pattern of Cys-X-Cys (X is optional amino acid). 

Accordingly, the present invention provides a process for constructing a cDNA library comprising: 
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(1 ) synthesizing single-stranded DNA complementary to mRNA isolated from subject cells using a random 
primer and ligating oligo dT to the 3 -end of the single-stranded DNA; 

(2) converting the single-stranded DNA obtained in (1 ) to a double-stranded DNA using as primer a poly 
A oligomer ligated to a first restriction enzyme (enzyme I) site; 

5 (3) fragmenting the double-stranded DNA obtained in (2), fractionating the fragments obtained by size, 

ligating linker containing a second restriction enzyme (enzyme II) site, differing from the enzyme I, thereto 
and fractionating; 

(4) amplifying fragments containing the sites of restriction enzymes I and It by a polymerase chain reaction 
using a first primer containing the enzyme I site and a second primer containing the enzyme II site, di- 

w gesting the cDNA thus amplified with the restriction enzymes, enzyme I and enzyme II and fractionating; 

and 

(5) ligating the cDNA fragment thus obtained upstream of a gene coding for a secretory protein or mem- 
brane protein having deleted therefrom the sequence coding for a signal peptide, integrating the ligated 
DNA into a eucaryotic cell expression plasmid vector, and transforming the vector into a host. 

15 The invention further provides a polypeptide having the amino acid shown in SEQ ID. No. 1 , in substantially 

purified form, a homologue thereof or a fragment of the sequence or homologue of a fragment, and DNA en- 
coding such a polypeptide. The polypeptide having the sequence shown in SEQ ID No. 1 has been identified 
using the process of the invention. 

Fig. 1 is a conceptional view of the process for constructing a cDNA library according to the present in- 

20 vention. 

Fig. 2 is a flow chart for the construction of a plasmid vector pcDL-SRa. 

Fig. 3 is a conceptional view of the process for constructing an EcoRI-SacI fragment of hG-CSF. 

Fig. 4 is a conceptional view of the process for constructing an Sacl-Kpnl fragment of hTac cDNA. 

Fig. 5 is a flow chart for the construction of pSGT and pSRT 
25 Fig. 6 is a conceptional view of the process for constructing an EcoRI-SacI fragment of hRARct. 

Fig. 7 is an FACS histogram showing the expression of a fusion protein hG-CSF-hTac on membrane. 

Fig. 8 is an FACS histogram showing the expression of a fusion protein hRAR-hTac on membrane. 

Fig. 9 is a conceptional view of the first half of the process for constructing the cDNA library of the Example. 

Fig. 10 is a conceptional view of the second half of the process or constructing the cDNA library of the 
30 Example. 

Fig. 11 is a hydrophobicity profile of (a part of) the polypeptide according to the present invention. 
The first subject of the present invention is concerned with a process for efficiently constructing a cDNA 
library of signal peptides. 

The process for constructing a cDNA library of signal peptides according to the present invention com- 
35 prises the following steps: 

(1 ) synthesizing a single-stranded DNA f rom mRNA isolated from the subject cells with the use of a random 
primer and adding oligo dT to the 3'-end of the single-stranded DNA thus obtained; 

(2) synthesizing a double-stranded DNA from the single-stranded DNA obtained in (1) using as a primer 
a poly A oligomer ligated to a specific restriction enzyme (enzyme I) site; 

40 (3) fragmenting the double-stranded DNA obtained in (2), fractionating the fragments obtained by size, 

ligating linker containing a specific restriction enzyme (enzyme II) site, differing from the enzyme I, thereto 
and fractionating again; 

(4) performing a PCR using a first primer containing the enzyme I site and a second primer containing 
the enzyme II site, digesting the cDNA thus amplified with enzyme I and enzyme II and fractionating; and 
45 (5) ligating the cDNA fragment upstream of the gene for a known secretory protein or membrane protein 

with the deletion of signal peptide and integrating the ligated DNA into an eucaryotic cell expression plas- 
mid vector, followed by transformation. 
Fig. 1 is a conceptional view of the above-mentioned steps, 
Now each of these steps will be illustrated in more detail. 
so In the step (1 ), if required, the subject cells are stimulated with an appropriate stimulating agent, and then 

the mRNA is isolated in accordance with known methods as described for example by Okayama H. et aL Meth- 
ods in Enzymology, 154 , 3 (1987). 

As the subject cells, any cells may be used, so long as they have a possibility of producing a secretory 
protein or the like. For example, nerve cells and hematopoietic cells may be cited therefor. A single-stranded 
55 cDNA can be synthesized with the use of a random primer by methods known per se. A marketed random pri- 
mer is available therefor. Subsequently, oligo dT is added to the 3'-end of the single-stranded cDNA by using 
a terminal deoxytransferase. 

In the step (2), a double-stranded cDNA is synthesized by methods known perse. Any restriction enzyme 
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sites may be used as the restriction enzyme (enzyme I) site to be ligated to a poly A oligomer serving as a 
primer and the restriction enzyme (enzyme li) site to be used in the next step (3). so long as they differ from 
each other. It is preferable to use EcoRI and Sacl, as enzyme I and enzyme II respectively. 

In the step (3). the double-stranded DNA is fragmented for example by ultrasonication so as to give an 

5 average cDNA length of 300 bp and the obtained fragments are fractionated into cDNAs of 200 to 500 bp by 
agarose gel electrophoresis (AGE). After blunting with T4DNA polymerase, enzyme II is ligated and the cDNAs 
are fractionated into DNAs of 200 to 500 bp again by agarose gel electrophoresis. As described above, any 
enzyme may be used as the enzyme II, so long as it differs from the enzyme I. The procedure in this step in- 
creases the likelihood that a cDNA fragment containing a signal peptide exists in the part located between 

10 the enzymes I and II. 

In the step (4), PCR is carried out in order to further elevate the possibility that a cDNA fragment containing 
a signal peptide exists in the part located between the enzymes I and II. PCR is a well known technique and 
automated devices therefore are commercially available. It is sufficient to amplify 25 to 30 times. The cDNA 
thus amplified is digested with enzyme I and enzyme II and electrophoresed on an agarose gel to thereby frac- 

15 tionate into cDNAs of 200 to 500 bp. 

In the step (5), a gene for a known secretory protein or the like with the deletion of signal peptide, which 
is called a reporter gene, and a cDNA fragment obtained in the above (4) are integrated into an eucaryotic cell 
expression plasmid vector in such a manner that the cDNA fragment is located upstream of the reporter gene. 
This is followed by transformation of the vector into a host. 

20 Various known eucaryotic cell expression plasmid vectors for example, pcDL-SRa and pcEV-4 which are 

capable of acting in Escherichia coli, are usable in the present invention. 

As the reporter gene, genes for mature protein parts of soluble secretory proteins and membrane proteins 
of any type are usable. The expression of these reporter genes may be confirmed by known methods such 
as antibody methods. Human IL-2 receptor u gene is especially suitable as a reporter gene. 

25 A number of E. coli strains are known as hosts for transformation and any of these stains is usable. It is 

preferable to use DH5 competent cells [described in Gene. 96, 23 (1990)] therefor. Transformants may be in- 
cubated in a conventional manner and thus the cDNA library of the present invention can be obtained. 

In the process for constructing a cDNA library according to the present invention, there is a high possibility 
that gene fragments coding for signal peptides are contained in the library. However, not every clone contains 

30 said fragment. Further, not all of the gene fragments code for unknown (novel) signal peptides. It is therefore 
necessary to screen a gene fragment coding for an unknown signal peptide from said library. 

Namely, the cDNAIibrary is divided into small pools of an appropriate size and integrated into an expression 
system. Examples of the expression system for producing a polypeptide include mammalian cells (for example, 
monkey COS-7 cells, Chinese hamster CHO cells, mouse L cells, etc.). Transfection may be performed in ac- 

35 cordance with well known methods such as the DEAE-dextran method. After the completion of the incubation, 
the expression of the reporter gene is examined. It is known that a reporter gene would be expressed even 
though the signal peptide is the one characteristic to another secretory protein. That is to say, the fact that the 
reporter gene has been expressed indicates that a signal peptide of some secretory protein has been integrated 
into the library. Positive pools are further divided into smaller ones and the expression and the judgement are 

40 repeated until a single clone is obtained. The expression of the reporter gene can be judged by, for example, 
fluorescence-labeled antibody assay, enzyme-linked immunosorbent assay (ELISA) or radioimmunoassay 
(RIA), depending on kinds of the employed reporter gene. 

Next, the nucleotide sequence of the isolated positive clone is determined. In the case of a cDNA which 
is proved to code for an unknown protein, the clone of the full length is isolated with the use of the cDNA as a 

*5 probe and the full nucleotide sequence can be thus identified. All of these operations are carried out by meth- 
ods which are well known by those skilled in the art. For example, the nucleotide sequence may be identified 
by the Maxam-Gilbert method or the dideoxy terminator method. On the other hand, the full length may be 
sequenced in accordance with a method described in Molecular Cloning [Sambrook. J.. Fritsch. E.F. and Man- 
iatis, T., published by Cold Spring Harbor Laboratory Press in 1989]. 

50 The present invention further relates to a novel polypeptide which has been identified using the process 

of the present invention to construct a cDNA library and a DNA coding for the same. In particular, it relates to; 

(1) a polypeptide consisting of an amino acid sequence represented by SEQ ID No. 1; 

(2) a DNA coding for the polypeptide described in the above (1 ); 

(3) a DNA having a nucleotide sequence represented by SEQ ID No. 2; and 
; 5 (4) a DNA having a nucleotide sequence represented by SEQ ID No. 3. 

A polypeptide of Seq. ID No. 1 in substantially purified form will generally comprise the polypeptide in a 
preparation in which more than 90%. eg. 95%. 98% or 99% of the polypeptide in the preparation is that of the 
Seq. ID No. 1. 
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A polypeptide homologue of the Seq. ID No. 1 will be generally at least 70%, preferably at least 80 or 90% 
and more preferably at least 95% homologous to the polypeptide of Seq. ID No. 1 over a region of at least 20, 
preferably at least 30, for instance 40, 60 or 100 more contiguous amino acids. Such polypeptide homologues 
will be referred to below as a polypeptide according to the invention. 

5 Generally, fragments of Seq. ID No. 1 or its homologues will be at lease 10, preferably at least 1 5, for ex- 

ample 20, 25, 30, 40, 50 or 60 amino acids in length, and are also encompassed by the term "a polypeptide 
according to the invention" as used herein, particular fragments of the polypeptides of the invention are frag- 
ments of which include amino acid residues numbered 1-70 in Seq ID No. 4 or a homologue thereof. 

A DNA capable of selectively hybridizing to the DNA of Seq. ID No. 2 or 3 will be generally at least 70%, 

w preferably at least 80 or 90% and more preferably at least 95% homologous to the DNA of Seq. ID No. 2 or 3 
over a region of at least 20, preferably at least 30, for instance 40, 60 or 100 or more contiguous nucleotides. 
Such DNA will be encompassed by the term "DNA according to the invention". 

Particular DNA capable of selectively hybridising to the DNAof SEQ ID No. 2 or 3 is the nucleotide residues 
numbered 139-348 in SEQ ID No. 4 or a fragment thereof. 

15 DNA according to the invention may be used to produce a primer, eg. a PCR primer, a probe eg. labelled 

by conventional means using radioactive or non-radioactive labels, or the DNA may be cloned into a vector. 
Such primers, probes and other fragments of the DNAof Seq. ID No. 2 or 3 will be at least 15, preferably at 
least 20, for example 25, 30 or 40 nucleotides in length, and are also encompassed by the term "DNA according 
to the invention" as used herein. 

20 DNA according to the invention may be produced recombinantly, synthetically, or by any means available 

to those of skill in the art. 

A further embodiment of the invention provides replication and expression vectors comprising DNA ac- 
cording to the invention. The vectors may be, for example, plasmid, virus or phage vectors provided with an 
origin of replication, optionally a promoter for the expression of the said DNA and optionally a regulator of the 

25 promotor. The vector may contain one or more selectable marker genes, for example an ampicillin resistance 
gene. The vector may be used in vitro , for example for the production of RNA corresponding to the DNA, or 
used to transform a host cell. 

A further embodiment of the invention provides host cells transformed or transfected with the vectors for 
the replication and expression of DNA according to the invention, including the DNA Seq. ID No. 2 or 3 or the 

30 open reading frame thereof. The cells will be chosen to be compatible with the vector and may for example 
be bacterial, yeast, insect or mammalian. 

DNA according to the invention may also be inserted into the vectors described above in an antisense 
orientation in order to provide for the production of antisense RNA (DNA). Antisense RNA (DNA) may also be 
produced by synthetic means. Such antisense RNA (DNA) may be used in a method of controlling the levels 

35 of a polypeptide of the invention in a cell. 

A further embodiment of the invention provides a method of producing a polypeptide which comprises cul- 
turing host cells of the present invention under conditions effective to express a polypeptide of the invention. 
Preferably, in addition, such a method is carried out under conditions in which the polypeptide of the invention 
is expressed and then secreted from the host cells. 

40 The invention also provides monoclonal or polyclonal antibodies to a polypeptide according to the inven- 

tion. The invention further provides a process for the production of monoclonal or polyclonal antibodies to the 
polypeptides of the invention. Monoclonal antibodies may be prepared by conventional hybridoma technology 
using a polypeptide of the invention or a fragment thereof, as an immunogen. Polyclonal antibodies may also 
be prepared by conventional means which comprise inoculating a host animal, for example a rat or a rabbit, 

45 with a polypeptide of the invention and recovering immune serum. 

The present invention also provides pharmaceutical compositions containing a polypeptide of the inven- 
tion, or an antibody thereof, in association with a pharmaceutical^ acceptable diluent or carrier. 

The invention also provides a polypeptide according to the invention or an antibody for use in a method 
of therapy or diagnosis on a human or animal body. 

so The polypeptides of the present invention include not only those having the amino acid sequence repre- 

sented by the SEQ ID No. 1 but also those with partial deletion thereof (for example, a polypeptide consisting 
of the mature protein part alone, or consisting of a part of the mature protein essentially required for the ex- 
pression of the biological activity), those with partial replacement by other amino acid(s) (for example, a poly- 
peptide some of amino acids are replaced by those having similar properties) and those with partial addition 

55 or insertion of amino acid(s). 

It is well known that there are up to six different codons which may code for a single amino acid (for ex- 
ample, one type of codon for Met while six types of codon for Leu). Accordingly, the nucleotide sequence of 
the DNA can be changed without altering the amino acid sequence of the polypeptide. 

5 
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The DNA as specified in (2) includes all nucleotide sequences coding for the polypeptide represented by 
Seq. ID No. 1 . Changes in the nucleotide sequence sometimes bring about an increase in the polypeptide pro- 
ductivity. 

The DNA as specified in (3) is an embodiment of the DNA as specified in (2) and represents a natural 
5 sequence. 

The DNA as specified in (4) represents a sequence wherein a natural non-translational region is added 
to the DNA as specified in (3). 

Asignal peptide is a highly hydrophobic region located immediately downstream of the translation initiation 
amino acid Met. It is assumed that the signal peptide in the polypeptide of the present invention resides in a 
to region ranging from Met at the 1 -position to Ser at the 19-position in the amino acid sequence represented by 
Seq. ID No. 1. The region essentially responsible for the expression of the biological activity corresponds to 
the part of the amino acid sequence of the Seq. ID No. 1 lacking of the signal peptide, i.e., the mature protein 
part. Thus the signal peptide never relates to the activity. 

The DNA having the nucleotide sequence represented by Seq. ID No. 3 can be prepared in accordance 
15 with the process described as the first subject of the present invention. 

Once the nucleotide sequences represented by Seq. ID No. 2 and No. 3 are determined, the DNA of the 
present invention can be chemically synthesized. Alternatively, the DNA of the present invention can be ob- 
tained by chemically synthesizing fragments of said nucleotide sequence and hybridizing with the use of the 
fragments as a probe. Further, the target DNA can be produced in a desired amount by introducing a vector 
20 DNA containing said DNA into an appropriate host and then incubating the host. 

Examples of methods for obtaining the polypeptide of the present invention include: 

(1) isolation and purification from vital tissues or cultured cells; 

(2) chemical synthesis of peptides; and 

(3) production with the use of gene recombination techniques. From an industrial viewpoint, the method 
25 described in (3) is preferable. 

Examples of the expression system (host-vector system) for producing the polypeptide by using gene re- 
combination techniques include those of bacteria, yeasts, insect cells and mammalian cells. 

In order to express in E. coli, for example, an initiator codon (ATG) is added to the 5'-end of the DNAcoding 
for the mature protein region. The DNA thus obtained is then ligated to the downstream of an appropriate pro- 

30 moter (for example, trp promoter, lac promoter, >.p L promoter, T7 promoter, etc.) and inserted into a vector ca- 
pable of functioning in E. coli (for example, pBR322, pUC18, pUC19, etc.). thus constructing an expression 
vector. Next, an E. coli strain (for example, E. coli, DH1, E. coli JM109, E.coli HB101, etc.) transformed with 
this expression vector is incubated in an appropriate medium. Thus the target polypeptide can be obtained from 
the incubated cells. Alternately, a bacterial signal peptide (for example, a signal peptide of pelB) may be used 

35 and thus the polypeptide can be secreted into the periplasm. Furthermore, a fusion protein together with other 
polypeptide can be produced. 

Expression in mammalian cells can be effected, for example, in the following manner. Namely, a DNAcod- 
ing for the nucleotide sequence represented by Seq. ID No. 3 is inserted into the downstream of an appropriate 
promoter (for example, SV40 promoter. LTR promoter, metallothionein promoter, etc.) in an appropriate vector 

40 (for example, retrovirus vector, papilloma virus vector, vaccinia virus vector, SV40-series vector, etc.), thus 
constructing an expression vector. Next, appropriate mammalian cells (for example, monkey COS-7 cells, Chi- 
nese hamster CHO cells, mouse L cells, etc.) are transformed with the expression vector obtained above and 
the transformant is incubated in an appropriate medium. Thus the target polypeptide can be secreted into the 
culture medium. The polypeptide thus obtained can be isolated and purified by conventional biochemical meth- 

45 ods. 

By using the process for constructing a cDNA library which is the first subject of the present invention, a 
DNA coding for a signal peptide of a secretory protein or a membrane protein can be efficiently selected and, 
in its turn, an unknown and useful protein can be efficiently found out. The novel polypeptide which is the sec- 
ond subject of the present invention is produced and secreted from a stroma cell line. Therefore, the polypep- 

50 tide has biological activities relating to the survival and proliferation of hematopoietic stem cells and the pro- 
liferation and differentiation of B cells and myeloid cells, and chemoattractant activity of neurophil. Accord- 
ingly, the polypeptide of the present invention per se is usable as an agent for preventing or treating, for ex- 
ample, anemia or leukopenia, infections, etc. 

In addition, the above-mentioned polypeptide existing in vivo can be assayed by using a polyclonal anti- 

55 body or a monoclonal antibody for said polypeptide, which is applicable to studies on the relationship between 
said polypeptide and diseases or to the diagnosis of diseases and the like. The polyclonal antibody and the 
monoclonal antibody can be prepared by a conventional method with the use of said polypeptide or a fragment 
thereof as an antigen. 
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The DNA according to the present invention serves as an important and essential template in the produc- 
tion of the polypeptide of the present invention which is expected to be highly useful. Further, the DNA of the 
present invention is applicable to the diagnosis and treatment of hereditary diseases, i.e., gene therapy, and 
therapy with ceasing the expression of the polypeptide by using antisense DNA(RNA). Furthermore, a gen- 
5 omic DNA can be isolated by using the DNA of the present invention as a probe. Similarly, a human gene for 
a related polypeptide being highly homologous with the DNA of the present invention and a gene of an organism 
other than human for a polypeptide being high homologous with the polypeptide of the present invention can 
be isolated. 

10 Examples 

The following Examples and Reference Example are illustrated, but not limit the present invention. 
Reference Example 1 

15 

Construction and expression of plasmid pcDL-SRa-h-G-CSF-hTac (pSGT) and plasmid pcDL-SRa-hRARa- 
hTac (pSRT) 

A plasmid, wherein a cDNA coding for a fusion protein of hG-CSF (human granulocyte colony stimulating 
factor, a typical example of a protein having a signal peptide) or hRARa (human retinoic acid receptor a, a 
20 typical example of a protein having no signal peptide), with hTac (human IL-2 receptor a, used as a reporter 
gene) was integrated into an eucaryotic cell expression plasmid vector pcDL-SRa having an SRa promoter 
[described in Mol. Cell. Biol., 8, 466 (1988), provided by Dr. Yutaka Takebe, National Institute of Health], was 
constructed. After transformation, the expression of the reporter protein on the membrane was examined. 
(1) By employing a plasimd pSP72-hG-CSF, wherein hG-CSF cDNA had been integrated into the EcoRI 
25 site of a plasmid pSP72 (purchased from Promega), as a template and using an SP6 promoter primer (pur- 

chased from Takara Shuzo Co., Ltd.) and an hG-CSF specific primer having an Sad site added thereto, 

5 ' GGA GATATC GAGCTCC TCGGGGTGGCACAG 3 ' 

30 EcoRV t SacI 

hG-CSF cDNA antisense 

PCR was performed 25 cycles (at 95 °C for one minute, at 48 °C for two minutes and at 72 °C for two 
35 minutes). The amplified DNA fragment was digested with Sacl-EcoRI and once subcloned into a plasmid 

pBlue script SK(+) (pBS). After digesting with Sacl-EcoRI again, an EcoRI-SacI fragment of hG-CSF was 
obtained. On the other hand, a plasmid pBS-hTac, wherein hTac cDNA had been integrated into the Hindlll 
site of pBS, was digested with Sacl-Kpnl to thereby give an Sacl-Kpnl fragment of hTac cDNA with the 
deletion of the signal sequence. These fragments were integrated into the EcoRI-Kpnl site of pcDL-SRa 
40 with the deletion of stuffer (Fig. 2) to thereby give a plasmid pcDL-SRot-hG-CSF-hTac (pSGT) (Figs. 3, 4 

and 5). 

Next, by employing a plasmid pGEM3-hRARa, wherein hRARa cDNA had been integrated into the 
EcoRI site of a plasmid pGEM3, as a template and using an SP6 promoter primer and an hRARa specific 
primer having an SacI site added thereto, 

45 

5' GGA GATATC GAGCTCA ATGGTGGCTGGGGATG 3' 
EcoRV SacI 



50 hRARa cDNA antisense 

PCR was performed. Subsequently, the procedure employed in the above-mentioned case of G-CSF was re- 
peated to thereby give a plasmid pcDL-SRa-hRARa-hTac (pSRT) (Figs. 6, 4 and 5). 
55 (2) The pSGT or pSRT obtained in the above (1) was transfected into COS-7 cells by the DEAE-dextran 

method [described in detail in Current Protocol in Molecular Biology, chapter 9.2.1.]. After 48 hours, the 
cells were harvested from a dish and incubated together with mouse anti-Tac igG antibody for 20 minutes 
on ice. After eliminating free antibodies, the mixture was incubated together with goat anti-mouse igG an- 

7 
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tibody labeled with fluorescein isothiocyanate (FITC) for 20 minutes on ice. After eliminating free antibo- 
dies again, the expression of a fusion protein G-CSF-Tac or RARa-Tac on the membrane was examined 
with a fluorescence activated cell sorter (Model FACS Can, manufactured by BECTON DICKINSON, here- 
inafter referred to simply as FACS). Figs. 7 and 8 show the results of the judgement. 
5 As shown in Fig. 7, G-CSF-Tac was expressed on the membrane as well as Tac. As shown in Fig. 8, on 

the other hand, RARu-Tac was not detected on the membrane but remained within the cells. These results 
indicate that when a cDNA fragment containing a signal peptide is ligated to the upstream of a reporter gene, 
said reporter protein is expressed on the cell membrane, while when a cDNAfragment having no signal peptide 
is ligated, the reporter protein is not expressed. 

10 

Example 1 

Construction of cDNA library having selectivity for signal peptides (Figs. 9 and 10) 

Total RNA was extracted from a mouse stroma cell line ST2 [cells supporting the survival and proliferation 

15 of hematopoietic stem cells and the proliferation and differentiation of B cells and myeloid cells; refer to EMBO 
J., 7, 1337 (1988)] by the acid guanidine-phenol-chloroform (AGPC) method [described in detail in "Saibo Ko- 
gaku Jikken Protokoru (Protocol in Cellular Engineering Experiments)", published by Shujun-sha, 28 - 31]. 
Then poly A-RNA was purified by using oligo (dT)-latex (Oligotex-dT30®, marketed from Takara Shuzo Co., 
Ltd.). By using a random hexamer as a primer, a single-stranded cDNA was synthesized with reverse tran- 

20 scriptase and dT was added to the 3'-end thereof with the use of terminal deoxyt ransferase. A 1 7 mer dA ligated 
to a restriction enzyme site containing EcoRI 

5' GA TGCGGCCGC CTCGAG GAATTC (dA) 17 3' 
25 Notl Xhol EcoRI 

was annealed and a double-stranded cDNA was synthesized by using the same as a primer. Then the cDNA 
was fragmentated by ultrasonication so as to give an average length of 300 bp and the cDNAs of 200 to 500 
bp were fractionated by agarose gel electrophoresis. After blunting the ends with T4DNA polymerase, a lone 
30 linker containing an Sacl site 

5' GAGGTACAAGCTT GATATC GAGCTCGCGG 3' 
3 ' CATG TTCGAA CTATAG CTCGAG CGCC 5' 

35 Hind III CcoRV Sacl 

[see Nucleic Acids Res., 18, 4293 (1990)] was ligated and cDNAs of 200 to 500 bp were fractionated again by 
agarose gel electrophoresis. By using a primer (NLC) containing an EcoRi site 

40 

NLC 

5' GATGCGGCCGCCTCGAGGAATTC 3' 
45 and another primer (LLHES) containing an Sacl site 
LLHES 

5' GAGGTACAAGCTTGATATCGAGCTCGCGG 3' 

50 

PCR was performed 25 cycles (at 94 °C for one minute, at 50 °C for two minutes and at 72 U C for two minutes). 
The amplified cDNA was digested with Sacl and EcoRI and cDNAs of 250 to 500 bp were fractionated by agar- 
ose gel electrophoresis. The cDNA was ligated to a plasmid obtained by digesting pSRT (prepared in Reference 
Example 1) with Sacl and EcoRI by using T4 DNA ligase. After transformation of an E. coli DH5a strain, a 
55 cDNA library having a selectivity for signal peptides was obtained. 
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Example 2 

Screening and analysis of cDNA coding for signal peptide 

About 1,200 colonies in the library obtained in Example 1 were divided into 24 pools (about 50 colonies/pool). 

5 Plasmids of each pools were isolated by the miniprep method and transfected into COS-7 cells by the DEAE- 
dextran method. After 48 to 72 hours, cell surface-staining for Tac of the transfectant was performed in the 
same manner as described in Reference Example 1 and 6 positive pools were selected under a fluorescent 
microscope. Colon ies of one pool from among the 6 posit ive pools were further divided and the same procedure 
as described above was repeated until a single clone was obtained. Thus a positive clone (pS-TT3) was ob- 

10 tained. Subsequently, by using two synthetic primers 



(20 bases downstream from Sacl cloning site, for antisense) 

20 which were specific for the pcDL-SRa-Tac vector, the nucleotide sequence of the TT3 insert was determined. 
An open reading frame following the Tac cDNAwith the deletion of the signal sequence in-frame was searched 
and converted into the deduced amino acid sequence. After performing a hydrophobicity profile, it was con- 
firmed that a hydrophobic region characteristic to a signal peptide was contained therein (Fig. 11). Further, 
the homology with data base on DNAand amino acid levels was examined. As a result, it has been found out 

25 that TT3 clone codes for an unknown protein. 

Example 3 

Screening of cDNA with the full length and determination of nucleotide sequence 
30 A cDNA library was constructed by using Super Script® Ramda System (marketed from BRL). Next, pS- 

TT3 was digested with Sacl and EcoRI and a TT3 cDNAf ragment was prepared by agarose gel electrophoresis. 
The library was screened by using an oligo-labeled TT3 cDNA fragment as a probe and thus a number of pos- 
itive clones were obtained. Among these clones, a TT3-1-6 clone showing the longest insert was selected and 
an Sall-Notl fragment excised from a ^gt22A vector was subcloned into pBS to thereby give a plasmid pBS- 
35 TT316. By using a T7 primer, the nucleotide sequence of 300 bp in the 5'-terminal of TT3-1-6 cDNA was de- 
termined. Thus it was confirmed that the sequence identical with TT3 of the probe existed in the most 5'-end 
of TT3-1-6. 

Next, a number of pBS-TT316 variant plasmids lacking of the 5-end or the 3'-end of TT3-1-6 cDNA were 
constructed by using an Exolll/Mung Bean Deletion Kit (manufactured by Stratagene). By using these variant 

40 plasmids, the nucleotide sequence of the full length of the cDNA was determined (sequence No. 3). From the 
full length cDNA sequence data, an open reading frame was determined and further translated into an amino 
acid sequence. Thus the sequence represented by Seq. ID No. 1 was obtained. The amino acid sequence at 
the 30- to 40-positions in the N-terminal of the amino acid sequence thus obtained were compared with known 
signal peptides. Thus the signal peptide part of this polypeptide was deduced to thereby give the sequence 

45 represented by Seq. ID No. 4. 



5' TTTACTTCTAGGCCTGTACG 3' 



15 



(20 bases upstream from EcoRI cloning site, for sense) 
and 



5' CCATGGCTTTGAATGTGGCG 3' 



50 



55 
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SEQUENCE LISTING 



GENERAL INFORMATION: 
APPLICANT: 

NAME: Ono Pharmaceutical Co., Ltd. 
STREET: 1-5 Doshomachi 2-chome 
CITY: Osaka 
COUNTRY: Japan 
POSTAL CODE (ZIP): 541 



TITLE OF THE INVENTION: 

Process for constructing cDNA library, and novel polypeptide and 

DNA coding for the same 
NUMBER OF SEQUENCES: 4 



30 SEQUENCE NO: 1 

LENGTH: 89 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
molecule TYPE: protein 
SEQUENCE: Seq. id No. l 



35 



40 



45 



50 



Met Asp Ala Lys Va 1 Val Ala Va 1 Leu Ala Leu Vdl Leu Ala Ala Leu 

1 ~ 10 15 

Cys He Ser Asp Gly Lys Pro Val Ser Leu Ser Tyr Arg Cys Pro Cys 

20 25 30 

Arg Phe Phe Glu Ser His He Ala Arg Ala Asn Val Lys His Leu Lys 

35 40 45 

He Leu Asn Thr Pro Asn Cys Ala Leu Gin He Val Ala Arg Leu Lys 

* 50 bb 60 

Asn Asn Asn Arg Gin Val Cys He Asp Pro Lys Leu Lys Trp He Glr. 

65 70 75 60 

Glu Tyr Leu Glu Lys Ala Leu Asn Lys 
8 5 



55 
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SEQUENCE NO: 2 

LENGTH: 267 base pairs 

TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 

molecule TYPE: cDNA to mRNA 

SEQUENCE: Seq. ID No. 2 

ATGGACGCCA AGGTCGTCGC CGTGCTGGCC CTGGTGCTGG CCGCGCTCTG CATCAGTGAC 60 
GGTAAACCAG TCAGCCTGAG CTACCGATGC CCCTGCCGGT TCTTCGAGAG CCACATCGCC 120 
AGAGCCAACG TCAAGCATCT GAAAATCCTC AACACTCCAA ACTGTGCCCT TCAGATTGTT 180 
GCACGGCTGA AGAACAACAA CAGACAAGTG TGC ATTGACC CGAAATTAAA GTGGATCCAA 2 40 
GAGTACCTGG AGAAAGCTTT AAACAAG 2 67 



SEQUENCE NO: 3 
LENGTH: 1797 base pairs 
TYPE: nucleic acid 

30 

STRANDEDNESS: single 
TOPOLOGY: linear 
35 MOLECULE TYPE: cDNA to mRNA 
SEQUENCE: seq. id no. 3 

40 



50 



55 



GACCACTTTC 


CCTCTCGGTC 


CACCTCGGTG 


GCGCGCCCTC 


CCGCCCACGC 


CATGGACGCC 


GCCGCGCTCT 


GCATCAGTGA 


CGGTAAACCA 


TTCTTCGAGA 


GCCACATCGC 


CAGAGCCAAC 


AACTGTGCCC 


TTC AGATTGT 


TGCACGGCTG 


CCGAAATTAA 


AGTGGATCC A 


AGAGT ACCTG 


CCCAAAGGAC 


TTTCC AGTAG 


ACCCCCGAGG 


AGTGGTGGGG 


AGGAGGGCCT 


GAACCCTGGC 


GGTC ATGCT A 


AGGTTTGCC A 


GCATAAAGAG 


AoCTTATATT 


CATCCCTGGC 


CTCGCCCGTG 


CTAAGGAATT 


GTATTAOCCT 


AACCAGTTAG 



TCCTCTTTGCT GTCCAGCTCT GCAGCCTCCG 6C 

AAGGTCGTCG CCGTGCTGGC CCTGGTGCTG 12 C 

GTC A tCCTGA GCTACCGATG CCCCTGCCGG 18C 

GTCAAGCATC TGAAAATCCT CAACACTCCA 240 

AAGAACAACA ACAGACAAGT GTGCATTGAC 30 0 

GAGAAAGCTT TAAACAAGTA AGO ACAACAG 360 

AAGGCTGACA TCCGTGGGAG ATGCAAGGGC 420 

C AGGATGGCC GGCGGGACAG CACTGACTGG 4 B0 

ACTCCGCCAT AGC AT ATGGT ACGATATTGC 54 0 

CACAATGGAG CTTTTATAAC TGGGGTTTTT 600 

CTTCATCCCC ATTCTCCTGA TCCTCATCTT 6 60 
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5 



15 



20 



25 



CATTTTAAAA 


AGCAGTGATT 


ACTTCAAGGG 


CTGT ATTCAG 


TTTGCTTTGG 


AGCTTCTCTT 


720 


TGCCCTGGGG 


CCTCTGGGCA 


CAGTTATAGA 


CGGTGGJTTT 


GC AG G GAGCC 


CTAGAGAGAA 


760 


ACCTTCCACC 


AGAGC AGAGT 


CCGAGGAACG 


CTGCAG3GCT 


TGTCCTGC AG 


GGGGCGCTCC 


840 


TCGACAGATG 


CCTTGTCCTG 


AGTCAACACA 


AGATCCGGCA 


GAGGGAGGCT 


CCTTTATCCA 


900 


GTTCAGTGCC 


AGGGTCGGGA 


AGCTTCCTTT 


AGAAGTGATC 


CCTGAAGCTG 


TGCTC AG ASA 


960 


CCCTTTCCTA 


GCCGTTCCTG 


CTCTCTGCTT 


GCCTCCAAAC 


GCATGCTTC A 


TCTGACTTCC 


102 0 


GCTTCTCACC 


TCTGT AGCCT 


GACGGACCAA 


TGCTGCAATG 


GAAGGGAGGA 


GAGTGATGTG 


10R0 


GGGTGCCCCC 


TCCCTCTCTT 


CCCTTTGCTT 


TCCTCTCACT 


TGGGCCCTTT 


GTGAGATTTT 


1140 


TCTTTGGCCT 


CCTGTAGAAT 


GGAGCCAGAC 


CATCCTGGAT 


AATGTGAGAA 


CATGCCTAGA 


1200 


TTTACCCACA 


AAACACAAGT 


CTGAGAATT A 


ATCATAAACG 


GAAGTTTAAA 


TGAGGATTTG 


1260 


GACCTTGGTA 


ATTGTCCCTG 


AGTCCTATAT 


ATTTC AACAG 


TGGCTCTATG 


GGCTCTGATC 


132C 


GAATATCAGT 


GATGAAAATA 


AT A AT AAT AA 


TAATAATAAC 


GAATAAGCC A 


GAATCTTGCC 


138C 


ATGAAGCCAC 


AGTGGGGATT 


CTGGGTTCCA 


ATC AG AAATG 


GAGACAAGAT 


AAAACTTGCA 


144C 


tacatt:tta 


TGATCAC AGA 


CGGCCCTGGT 


GGTTTTTGGT 


AACTATTTAC 


AAGGCATTTT 


1500 


TTT ACATATA 


TTTTTGTGCA 


CTTTTTATGT 


TTCTTTGGAA 


GACAAATGTA 


TTTC AGAAT A 


156C 


TATTTGT AGT 


CAATTCATAT 


ATTTGAAGTG 


GAGCC AT AGT 


AATGCCAGT A 


GAT ATCTCTA 


162C 


TGATCTTGAG 


CTACTGGCAA 


CTTGTAAAGA 


AAT AT AT ATG 


ACATATAAAT 


GTATTGT AGC 


1680 


TTTCCG 3TGT 


CAGCC ACGGT 


GTATTTTTCC 


ACTTGGAATG 


AAATTGTATC 


AACTGTG AC A 


17 40 


TTATAT 3CAC 


TAGC AAT AAA 


ATGCT AATTG 


TTTC ATGCTG 


T AAAAAAAA A 


AAAAAAA 


1797 



30 

SEQUENCE NO: 4 
LENGTH: 1797 base pairs 
35 TYPE: nucleic acid 

STRANDEDNESS: single 
TOPOLOGY: linear 

40 

MOLECULE TYPE: cDNA to mRNA 
ORIGINAL SOURCE 
45 ORGANISM: Mouse 

CELL LINE: ST2 
FEATURE 

50 

NAME/KEY: CDS 
LOCATION: 82 .. 351 
IDENTIFICATION METHOD: P 



12 
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NAME/KEY: sig peptide 
LOCATION: 82 .. 138 
IDENTIFICATION METHOD: S 



TO 



15 



25 



NAME/KEY: mat peptide 

LOCATION: 139 . 348 
IDENTIFICATION METHOD: S 
SEQUENCE: Seq. ID No. 4 

GACC ACTTTC CCTCTCGGTC CACCTCGGTG TCCTCTTGCT GTCCAGCTCT GCAGCCTCCG 



60 



GCGCGCCCTC CCGCCCACGC C ATG GAC GCC AAG GTC GTC GCC GTG CTG GCC 111 

Met Asp Ala Lys Va 1 Va 1 Ala Val Leu Ala 
-15 -10 

CTG GTG CTG GCC GCG CTC TGC ATC AGT GAC GGT AAA CCA GTC AGC CTG 159 
Leu Val Leu Ala Ala Leu Cys lie Ser Asp Gly Lys Pro Val Ser Leu 

-5 1 5 

AGC T AC CGA TGC CCC TGC CGG TTC TTC GAG AGC CAC ATC GCC AGA GCC 207 
Ser Tyr Arg Cys Pro Cys Arg Phe Phe Glu Ser His lie Ala Arg Ala 
30 10 15 2 0 

AAC GTC AAG CAT CTG AAA ATC CTC AAC ACT CCA AAC TGT GCC CTT CAG 2 55 

Asn Val Lys His Leu Lys lie Leu Asn Thr Pro Asn Cys Ala Leu Gin 

25 30 30 

ATT GTT GCA CGG CTG AAG AAC AAC AAC AGA CAA GTG TGC ATT GAC CCG 30 3 

lie Val Ala Arg Leu Lys Asn Asn Asn Arg Gin Val Cys lie Asp Pro 
40 45 50 55 

40 AAA TTA AAG TGG ATC CAA GAG TAC CTG GAG AAA GCT TTA AAC AAG TAA 351 

Lys leu Lys Trp He Gin Glu Tyr Leu Glu Lys Ala Leu Asn Lys 
60 6 5 7 0 

45 GC AC AAC AGC CCAAAGGACT TTCC AGTAGA CCCCCGAGGA AGGCTGAC AT CCGTGGGAGA A 1 1 

TGCAAGGGCA GTGGTGGGGA GGAGGGCCTG AACCCTGGCC AGGATGGCCG GCGGGACAGC 47 1 
AC TGACTGGG GTC ATGCT AA GGTTTGCCAG CAT A A AG AC A CTCCGCCATA GC ATATGGTA 531 
CGATATTGCA GCTT AT ATTC ATCCCTGCCC TCGCCCGTGC ACAATGGAGC TTTT ATAACT 5 91 

50 

GGGGTTTTTC TAAGG AACTG TATTACCCTA ACC AGTTAGC TTCATCCCCA TTCTCCTC AT 651 
CCTCATCTTC ATTTTAAAAA GCAGTGATTA CTTCAAGGGC TGTATTCAGT TTGCTTTGGA 711 
G C T T C TC TTT ( ;C C C TGG ( ; C, C c TCT G ' 'M ^ Z AC AGTTATAGAC GGTGG< V T T TG C AGGG AG C CC 7 7 j 
55 TAGAGAGAAA COTTCC ACC A GAGCAGAGTC CGAGGAACGC TGC AGGGCTT GTCCTGCAGG 8 31 

GGGCGCTCC7 CGACYu <ATG< " ' "I'T( ;TC('T( , A < , T' " A AC A'" A A G ATCCGGC AG AGGGAGG CTC H <4 . 
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# 



CTTTATCC AG TTC A3TGCCA G3GTCGGGAA 

GCTCAGAGAC CCTTTCCTAG CCGTTCCTGC 

CTGACTTCCG CTTCTCACCT CTGTAGCCTG 

AGTGATGTGG GGTGCCCCCT CCCTCTCTTC 

TGAGATTTTT CTTTGGCCTC CTGTAGAATG 

ATGCCTAGAT TTACCCACAA AACAC AAGTC 

GAGGATTTGG ACCTTGGTAA TTGTCCCTGA 

GC7CTGATCG AATATCAGTG ATGAAAATAA 

AATCTTGCCA TGAAGCCACA GTGGGGATTC 

AAACTTGCAT ACATTCTTAT GATCACAGAC 

AGGCATTTTT TT AC A TAT AT TTTTGTGCAC 

TTCAGAATAT ATTTGTAGTC AATTC AT ATA 

ATATCTCT AT GATCTTGAGC TACTGGZAAC 

TATTG TAGCT TTCCGGTGTC AGCCACGGTG 

ACTGTGAC AT T AT ATGC ACT AGCAATAAAA 
AAAAAA 



GCTTCCTTTA 


GAAGTGATCC 


CTGAAGCTGT 


95 ; 


TCTCTGCTTG 


CCTCCAAACG 


CATGCTTC AT 


1011 


ACGGACCAAT 


GCTGC AATGG 


AAGGGAGGAG 


1 07 1 


CCTTTGCTTT 


CCTCTCACTT 


GGGCCCTTTG 


1131 


GAGCCAGACC 


ATCCTGGAT A 


ATGTGAGAAC 


1191 


TGAGAATTAA 


TC ATAAACGG 


AAGTTTAAAT 


1251 


GTCCT ATATA 


TTTCAACAGT 


GGCTCTATGG 


1311 


TAATAATAAT 


AATAATAACG 


AATAAGCCAG 


1371 


TGGGTTCCAA 


TCAGAAATGG 


AGACAAGATA 


1431 


GGCCCTGGTG 


GTTTTTGGTA 


ACT ATT TAG A 


1491 


TTTTT ATGTT 


TCTTTGGAAG 


ACAAATGT AT 


15 51 


TTTGAAGTGG 


AGCCATAGTA 


ATGCCAGTAG 


'611 


TTGTAAAGAA 


ATATATATGA 


CATATAAATG 


1671 


TAT TTTTCC A 


CTTGGAAT3A 


AATTGTATCA 


1731 


TGCT AATTGT 


TTCATGCTGT 


A AAAAAA AAA 


17 9 1 
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Claims 



30 1. A process for constructing a cDNA library comprising; 

(1) synthesizing single-stranded DNA complementary to mRNA isolated from subject cells using a ran- 
dom polymer and ligating oligo dT to the 3'-end of the single-stranded DNA; 

(2) converting the single-stranded DNA obtained in (1) to a double-stranded DNA using as primer a 
poly A oligomer tigated to a first restriction enzyme (enzyme I) site; 

35 (3) fragmenting the double-stranded DNAobtained in (2), fractionating the fragments obtained by size, 

ligating linker containing a second restriction enzyme (enzyme II) site, differing from the enzyme I, 
thereto and fractionating; 

(4) amplifying fragments containing the sites of restriction enzymes I and II by a polymerase chain re- 
action using a first primer containing the enzyme I site and a second primer containing the enzyme II 

•«? site, digesting the cDNA thus amplified with the restriction enzymes, enzyme I and enzyme II and frac- 

tionating; and 

(5) ligating the cDNAf ragment thus obtained upstream of a gene coding for a secretory protein or mem- 
brane protein having deleted therefrom the sequence coding for a signal peptide, integrating the ligated 
DNA into a eucaryotic cell expression plasmid vector, and transforming the vector into a host. 



45 



55 



2. A process according to claim 1. wherein EcoRI is used as the enzyme I. Sacl is used as the enzyme II, 
and human IL-2 receptor « gene is used as a known secretory protein or membrane protein gene with 
the deletion of signal peptide. 

3. A polypeptide having the amino acid sequence shown in SEQ ID No. 1 in substantially purified form, a 
homologue thereof or a fragment of the sequence or homologue of a fragment. 

4. A polypeptide according to claim 1 having the amino acid sequence shown in SEQ ID No. 1. 

5. A polypeptide comprising a polypeptide according to claim 3 or 4, having the amino acid sequence of re- 
sidues 1 to 70 shown in SEQ ID No. 4 in substantially purified form, a homologue thereof, or a fragment 
of the sequence or homologue of a fragment. 
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6. A polypeptide according to claim 5 having the amino acid sequence shown of residues 1 to 70 shown in 
SEQ ID No. 4. 

7. DNA encoding a polypeptide according to any one of the preceding claims. 

5 8. DNA according to claim 7 having the nucleotide sequence shown in SEQ ID No. 2 or a fragment thereof 
capable of selectively hybridizing to SEQ ID No. 2. 

9. DNA according to claim 7 having the nucleotide sequence shown in SEQ ID No. 3 or a fragment thereof 
capable of selectively hybridizing to SEQ ID No. 3. 

10 

10. DNA according to claim 7 having the nucleotide sequence of residues 139 to 348 shown in SEQ ID No. 
4, or a fragment thereof capable of selectively hybridizing to the nucleotide sequence of residues 139 to 
348 shown in SEQ ID No. 4. 

15 11. DNA according to claim 10 having the nucleotide sequence of residues 139 to 348 shown in SEQ ID No. 
4. 

12. A replication and expression vector comprising DNA according to any one of claims 7 to 11 . 
20 13. Host cells transformed or transfected with a replication and expression vector according to claim 12. 

14. A method of producing a polypeptide which comprises culturing host cells according to claim 13 under 
conditions effective to express a polypeptide according to any one of claims 3 to 6. 

15. A monoclonal or polyclonal antibody to a polypeptide according to any one of claims 3 to 6. 

25 

16. A pharmaceutical composition containing a polypeptide according to any one of claims 3 to 6 or an anti- 
body according to claim 15 in association with a pharmaceutically acceptable diluent or carrier. 



30 



35 



40 



45 



50 
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Figure 2 
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Figure 3 
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Figure 4 
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Figure 6 
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Figure 7 




Fluorescense 1 



22 



EP 0 607 054 A2 



Figure 8 
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Figure 9 
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Figure 10 
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Figure 11 
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